IN THE CLAIMS: 



Please amend the claims as follows: 

1 . (Currently amended) A process for creating an ensemble filter for selecting 
documents, comprising: 

identifying a sot of docum e nts for training; 

identifying a first set of documents from_a_said training set of documents; 

identifying a first profile corresponding to said first set of documents; 

identifying a second set of documents and a r e maind e r third set of documents from said 
training set of documents using said first profil e; 

identifying at least on e a fourth set of documents from said r e maind e r third set of 
documents; 

identifying at least on e r e mainder a second p rofile corresponding to e ach of said fourth 
identifi e d s e ts of docum e nts from said remaind e r set of documents; 

creating a first sub - filter using filter based upon said first profile; 

creating a second filter based upon said second profile at l e ast one r e maind e r sub filt e r 
using at l e ast on e of s aid remainder profil e s; and 

combining said first filter sub - filt e r with at least one remaind e r sub filter said second 
filter t o create an ensemble filter ; and 

storing said ensemble filter in a computer readable medium, said ensemble filter being 
accessible by computer readable program code for filtering documents . 

2. (Previously presented) A process, as in claim 1, further comprising: 
clustering said training set of documents to identify said first set of documents. 

3. (Previously presented) A process, as in claim 1, further comprising: 
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clustering said training set of documents and selecting said largest cluster to identify said 
first set of documents. 

4. (Currently amended) A process, as in claim 1, further comprising: 

cascading said first sub filt e r filter and at l e ast on e r e maind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

5. (Currently amended) A process, as in claim 1, further comprising: 

multiplexing said first sub filt e r filter with at least on e r e maind e r sub - filt e r with said 
second filter to create at least part of said ensemble filter. 

6. (Currently amended) A process, as in claim 2, further comprising: 

cascading said first sub filt e r filter and at l e ast one r e maind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

7. (Currently amended) A process, as in claim 3, further comprising: 

cascading said first sub filt e r filter and at l e ast on e r e maind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

8. (Currently amended) A process, as in claim 2, further comprising: 

multiplexing said first sub filt e r filter with at l e ast on e remaind e r sub filt e r with said 
second filter to create at least part of said ensemble filter. 

9. (Currently amended) A process, as in claim 3, further comprising: 

multiplexing said first sub - filt e r filter with at l e ast on e remaind e r sub filt e r with said 
second filter to create at least part of said ensemble filter. 
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10. (Currently amended) A process for selecting documents from a stream of 
documents, comprising: 

identifying a s e t of docum e nts for training; 

identifying a first set of documents from_a_said training set of documents; 

identifying a first profile corresponding to said first set of documents; 

identifying a second set of documents and a remaind e r third set of documents from said 
training set of documents using said first profil e; 

identifying at l e ast on e a fourth set of documents from said r e maind e r third set of 
documents; 

identifying at l e ast on e r e maind e r a second p rofile corresponding to each of said fourth 
id e ntifi e d s e ts of docum e nts from said r e maind e r set of documents; 

creating a first sub filt e r using filter based upon said first profile; 

creating a second filter based upon said second profile; at l e ast on e r e maind e r sub filt e r 
using at l e ast on e of said remainder profil e s; and 

combining said first filter sub filt e r with at l e ast on e r e maind e r sub filter said second 
filter to create an ensemble filte r; and 

passing said stream of documents through said ensemble filter. 

11. (Previously presented) A process, as in claim 10, further comprising: 
clustering said training set of documents to identify said first set of documents. 

12. (Previously presented) A process, as in claim 10, further comprising: 

clustering said training set of documents and selecting said largest cluster to identify said 
first set of documents. 

13. (Currently amended) A process, as in claim 10, further comprising: 
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cascading said first sub filt e r filter and at least on e remaind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 



14. (Currently amended) A process, as in claim 10, further comprising: 

multiplexing said first sub filt e r filter with at l e ast one remaind e r sub - filt e r with said 
second filter to create at least part of said ensemble filter. 

15. (Currently amended) A process, as in claim 11, further comprising: 

cascading said first sub filt e r filter and at l e ast on e remaind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

16. (Currently amended) A process, as in claim 12, further comprising: 

cascading said first sub filt e r filter and at l e ast on e r e mainder sub filt e r with said second 
filter to create at least part of said ensemble filter. 

17. (Currently amended) A process, as in claim 11, further comprising: 

multiplexing said first sub - filt e r filter with at least on e r e maind e r sub filt e r with said 
second filter to create at least part of said ensemble filter. 

18. (Currently amended) A process, as in claim 12, further comprising: 

multiplexing said first sub filt e r filter with at l e ast one r e maind e r s ub filt e r with said 
second filter to create at least part of said ensemble filter. 

19. (Currently amended) A process for selecting documents from a database of 
documents, comprising: 

id e ntifying a set of docum e nts for training; 

identifying a first set of documents from_a_sakl training set of documents; 
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identifying a first profile corresponding to said first set of documents; 

identifying a second set of documents and a r e maind e r third set of documents from said 
training set of documents using said first profil e; 

identifying at l e ast on e a fourth set of documents from said r e mainder third set of 
documents; 

identifying at least one r e maind e r a second p rofile corresponding to each of said fourth 
id e ntifi e d s e ts of documents from said remainder set of documents; 

creating a first sub - filt e r using filter based upon said first profile; 

creating a second filter based upon said second profile; at least on e r e maind e r sub filt e r 
using at l e ast on e of said r e maind e r profiles; and 

combining said first filter sub filt e r with at least on e r e maind e r sub filter said second 
filter to create an ensemble filter ; and 

applying said ensemble filter to said database to select documents. 

20. (Previously presented) A process, as in claim 19, further comprising: 
clustering said training set of documents to identify said first set of documents. 

21. (Previously presented) A process, as in claim 19, further comprising: 

clustering said training set of documents and selecting said largest cluster to identify said 
first set of documents. 

22. (Currently amended) A process, as in claim 19, further comprising: 

cascading said first sub filt e r filter and at l e ast on e r e mainder sub filt e r with said second 
filter to create at least part of said ensemble filter. 

23. (Currently amended) A process, as in claim 19, further comprising: 
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multiplexing said first sub filt e r filter with at least on e r e mainder sub filt e r with said 
second filter to create at least part of said ensemble filter. 

24. (Currently amended) A process, as in claim 20, further comprising: 

cascading said first sub filt e r filter and at l e ast on e r e maind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

25. (Currently amended) A process, as in claim 21 , further comprising: 

cascading said first sub filt e r filter and at least on e r e maind e r sub filt e r with said second 
filter to create at least part of said ensemble filter. 

26. (Currently amended) A process, as in claim 20, further comprising: 

multiplexing said first sub filt e r filter with at l e ast on e r e maind e r sub filt e r with said 
second filter to create at least part of said ensemble filter. 

27. (Currently amended) A process, as in claim 21, further comprising: 

multiplexing said first sub filt e r filter with at l e ast one r e mainder sub filt e r with said 
second filter to create at least part of said ensemble filter. 

28. (New) An apparatus for generating an ensemble filter, comprising: 
a processing system; and 

a memory coupled to the processing system, wherein the processor is configured to: 

identify a first set of documents from a training set of documents; 

identify a first profile corresponding to said first set of documents; 

identify a second set of documents and a third set of documents from said training 
set of documents; 
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identify a fourth set of documents from said third set of documents; 

identify a second profile corresponding to said fourth set of documents; 

create a first filter based upon said first profile; 

create a second filter based upon said second profile; and 

combine said first filter with said second filter to create an ensemble filter; 

store said ensemble filter in a computer readable medium, said ensemble filter 
being accessible by computer readable program code for filtering documents. 



29. (New) An article of manufacture comprising a computer readable medium having 
executable program code embodied therein for generating an ensemble filter, wherein the 
executable program code is adapted to cause the processing system to: 

identify a first set of documents from a training set of documents; 

identify a first profile corresponding to said first set of documents; 

identify a second set of documents and a third set of documents from said training 
set of documents; 

identify a fourth set of documents from said third set of documents; 

identify a second profile corresponding to said fourth set of documents; 

create a first filter based upon said first profile; 

create a second filter based upon said second profile; and 

combine said first filter with said second filter to create an ensemble filter; 

store said ensemble filter in a computer readable medium, said ensemble filter 
being accessible by computer readable program code for filtering documents. 
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